Estimating the effect of corporate integrity culture on tax avoidance using a text-based approach: A research note

Tax avoidance holds immense importance due to its substantial implications for government revenues and the fair allocation of resources. Consequently, understanding the factors that shape tax avoidance is critically important. Exploiting a cutting-edge measure of corporate integrity derived from state-of-the-art machine learning algorithms and textual analysis, we explore the effect of corporate integrity on tax avoidance. Our text-based measure is based on a textual analysis of earnings conference call transcripts. Our findings show that companies with greater corporate integrity are significantly less involved in tax avoidance. Further analysis corroborates the results, i.e., propensity score matching, entropy balancing, and an instrumental variable analysis. Our findings are especially noteworthy as they demonstrate that corporate culture, although intangible in nature, exerts a substantial influence on corporate behavior.


I. Introduction
Tax avoidance is a critically important issue due to its significant impact on government revenues and the equitable distribution of resources.Addressing this issue is essential for promoting a fair and sustainable economic system.Not surprisingly, tax avoidance has generated an immense volume of research in accounting, finance, economics, and other areas (see [1] for a literature review).We extend the body of knowledge in this area by investigating how corporate tax avoidance is influenced by one of the most important corporate cultural traits, i.e., corporate integrity.
Corporate culture can be defined as the collective set of beliefs, values, and preferences shared by employees within a corporation [2,3].Corporate culture is abstract and is difficult to quantify empirically.However, recently, Li et al. [4] employ advanced machine learning algorithms and sophisticated textual analysis of earnings conference calls to identify corporate cultural traits.Using Li et al.'s [4] text-based approach, we explore how a culture of corporate integrity influences corporate tax avoidance.
Firms with a strong culture of integrity should be less involved in corporate tax avoidance.Such firms prioritize transparency, compliance with laws, and ethical decision-making, which extends to their tax practices.Corporate tax avoidance often involves exploiting legal loopholes and engaging in aggressive tax planning to minimize tax liabilities, sometimes at the expense of society's broader interests.Companies that prioritize integrity are more likely to adhere to the spirit of the tax laws and contribute their fair share of taxes to the society in which they operate [4].Furthermore, firms with a strong culture of integrity tend to value long-term sustainability and the preservation of their reputation.Engaging in aggressive tax avoidance can be detrimental to a company's image and could lead to reputational damage if such practices are exposed or criticized by the public, shareholders, or other stakeholders.Therefore, we hypothesize that firms with a stronger culture of integrity exhibit less tax avoidance.
Using a large sample of U.S. firms and a distinctive measure of corporate integrity derived from sophisticated machine learning algorithms and textual analysis, we show that greater corporate integrity results in a significant reduction in tax avoidance, corroborating our hypothesis.To mitigate endogeneity, we perform a variety of robustness checks, i.e., propensity score matching, entropy balancing, and an instrumental variable analysis.All robustness checks validate the results.
Our results extend the existing literature in several important ways.First, we extend the literature on corporate culture.Previous research has predominantly examined corporate culture from a theoretical perspective [2][3][4][5][6].However, empirical investigations on corporate culture have been limited.Our study addresses this gap and stands as the first empirical research to document that a culture of integrity influences corporate tax avoidance significantly.
In addition to our contribution to the literature on corporate culture, we also enrich the existing research on tax avoidance.Prior studies have primarily focused on accounting or financial factors as determinants of tax avoidance ( [1], provide a comprehensive review).Our study aptly augments this body of knowledge by demonstrating that corporate culture, a nonfinancial and abstract attribute, significantly influences corporate tax avoidance.Finally, we contribute to an emerging area of empirical research that employs textual analysis (see [7], for a literature review).We show that textual analysis can be used to extract abstract quality and create metrics that are empirically relevant and useful.

II. Prior research on tax avoidance
Our research belongs to a crucial stream of literature that investigates the determinants of tax avoidance.This area holds significant importance and has given rise to an extensive body of research.Early studies primarily concentrated on intrinsic corporate attributes such as company size and operational strategies [1,[8][9][10].By contrast, contemporary tax avoidance research has integrated corporate governance attributes aimed at mitigating agency conflicts [1,11].For instance, according to McGuire et al. [12], firms employing dual class share structures tend to exhibit a reduced inclination towards tax avoidance.This could be because insiders hold control over voting rights, potentially alleviating the pressure on management from external shareholders to partake in tax avoidance practices.Richardson et al. [13] find a decline in tax avoidance when the internal audit committee consists of more outside independent directors.External governance mechanisms, such as media exposure, are also relevant to tax avoidance.As discussed in Tian et al. [14]and Kanagaretnam et al. [15], lower levels of tax avoidance are documented when there is strong media exposure of aggressive tax behaviors.
Product market considerations also play a role in tax avoidance.For instance, Kubick et al. [16] highlight that companies leading in the product market tend to employ greater tax avoidance strategies, leveraging their comparative advantage.Conversely, entities possessing valuable brands tend to exhibit lower levels of tax avoidance due to their concerns about maintaining a positive reputation [17].Several other factors also influence the extent of tax avoidance.Wang et al. [1] offer a contemporary and complete literature review of tax avoidance.
While it is intuitive to assume that corporate integrity is relevant to tax avoidance, there is surprisingly scant research on this issue.One of the reasons is that it is challenging to capture corporate integrity.While corporate integrity is a straightforward and simple concept, it is difficult to operationalize it empirically.We address this gap in the literature by exploiting an innovative measure of a corporate culture of integrity based on advanced algorithms and textual analysis.

III. Sample selection and data description
Our original sample comes from Li et al. [4].Then, we combine the data with COMPUSTAT to obtain firm-specific characteristics and financial statement data necessary to estimate tax avoidance measures.The final sample comprises 41,138 firm-year observations from 2001 to 2018.
The measure of corporate integrity used in our study is derived from Li et al. [4], who employ a sophisticated approach to extract phrases pertaining to corporate integrity from earnings conference call transcripts.The frequency of appearance of these phrases serves as an indicator of the level of integrity possessed by the firm.Li et al. [4] execute a variety of validation tests and conclude that their approach is reliable and useful.More information about the construction of the text-based corporate culture score, of which corporate integrity is an important component, is provided in the S1 Appendix.
For tax avoidance, we utilize several alternative measures for robustness, i.e., (1) cash effective tax rate (cash ETR), (2) GAAP effective tax rate (GAAP ETR), (3) book-tax-differences (BTD) and (4) permanent book-tax-differences (Perm-diff).These measures of tax avoidance are widely used in the literature (De Simone et al., 2020).We multiply the cash ETR and the GAAP ETR by minus one in the regression analysis for ease of interpretation.Therefore, a higher value of each of our measures indicates greater tax avoidance.
Additionally, we combine all the four measures discussed above into a single index using principal component analysis (PCA).PCA allows researchers to transform their original variables into a new set of uncorrelated variables, known as principal components.These components are ordered by their ability to explain the variance in the data, with the first component capturing the most variance and subsequent components capturing decreasing amounts.By reducing the dimensionality of our data while retaining the most important information, PCA helps identify the underlying structure and relationships among variables.By focusing on what the four different measures share, we can reduce errors considerably.We referred to the combined measure, which is the first component resulting from PCA, as the tax avoidance index.
We incorporate several firm-specific attributes that may affect tax avoidance.Specifically, we include controls for firm size (natural logarithm of total assets), profitability (EBIT divided by total assets), leverage (total debt divided by total assets), capital investments (capital expenditures/total assets), cash holdings (cash holdings divided by total assets), intangible assets (research and development (R&D) expenses divided by total assets, and advertising expenses divided by total assets), asset tangibility (fixed assets divided by total assets), dividend payouts (total dividends divided by total assets), and discretionary spending (selling, general, and administrative (SG&A) expenses divided by total assets).Additionally, we include industry and year fixed effects to account for variations across industries and over time.It is difficult to include firm fixed effects in the context of our study as corporate culture changes slowly over time, making it challenging to incorporate firm fixed effects.Table 1 shows the summary statistics for the variables.
Essentially, we run the following OLS regression analysis with industry and year fixed effects: where i indexes firms and t indexes years.

IV. Results
The results are presented in Table 2. Standard errors are clustered by firm.Model 1 has the cash ETR as the dependent variable.The coefficient associated with corporate integrity is significantly positive at the 10% level, suggesting that corporate integrity results in less tax avoidance.The dependent variable in Model 2 is the GAAP ETR, where the coefficient of corporate integrity is insignificant.Model 3 and Model 4 have book-tax differences and permanent book-tax differences as dependent variables respectively.The coefficients of corporate integrity in Model 3 and Model 4 are significantly negative, implying that greater corporate integrity diminishes tax avoidance.Finally, we use the combined measure of tax avoidance in Model 5 and obtain a significantly negative coefficient for corporate integrity.Overall, our results demonstrate a significant decline in tax avoidance in the presence of higher corporate integrity, consistent with our hypothesis.
To minimize endogeneity, we run several robustness checks and show the results in Table 3. First, we perform propensity score matching (PSM).We classify firms in the top quartile of corporate integrity as our treatment group.For each observation in the treatment group, we select an observation in the rest of the sample that is most similar using ten firm-specific attributes (the ten control variables in the regression analysis).Hence, our treatment and control groups are indistinguishable in every observable dimension, except for corporate integrity.Model 1 in Table 3 contains the PSM result, showing a significantly negative coefficient for corporate integrity.In Model 2, we execute entropy balancing, where we adjust the weight of each observation such that the means and the variances of the treatment and control groups are comparable.Again, the coefficient of corporate integrity is significantly negative.Furthermore, we implement an instrumental-variable analysis (IV).Our first instrument is the value of corporate integrity in the earliest year for each firm.Since corporate integrity in the earliest year could not have resulted from corporate integrity in any of the subsequent years, reverse causality is mitigated.We conduct an instrumental-variable analysis in two stages.Initially, we regress the corporate integrity score on our instrumental variable and all control variables, subsequently saving the predicted value of the corporate integrity score.In the second stage, we regress tax avoidance on this predicted value and the control variables.This approach aligns with the standard procedure for instrumental variable analyses.Model 3 is the second-stage regression result, where the coefficient of corporate integrity instrumented from the first stage is significantly negative.The Shea partial R 2 for our first-stage regression stands at 30.89%, and with an F-statistic of 14,907.08, it is clear that our instrument is robust and statistically significant.
Finally, we employ another instrumental variable.Due to investors' clientele, local competition, and social interactions, companies located nearby tend to share similar characteristics, including corporate culture [18].Our second instrumental variable is the average value of corporate integrity of all firms located in the same city.The value at the city level should influence corporate integrity at the firm level but is not directly correlated with firm-specific tax avoidance because there are many firms in the same city.Again, we carry out a two-stage instrumental-variable analysis.First, we regress the corporate integrity score against our instrumental variable and control variables, then save the predicted score.In the second stage, we regress

V. Conclusions
We hypothesize that companies with a stronger culture of integrity should be less involved in tax avoidance.Based on a unique measure of corporate integrity generated by cutting-edge machine learning algorithms and textual analysis, our findings show that greater corporate integrity brings about a significant reduction in corporate tax avoidance, corroborating our hypothesis.Additional robustness checks validate the results, i.e., propensity score matching, entropy balancing, and an instrumental variable analysis.Our findings are crucially important as they demonstrate the tangible influence of corporate culture on corporate behavior, despite its abstract nature.
Our research findings carry significant practical implications for diverse stakeholders.First, shareholders and managers gain insights into the significance of corporate culture, despite its abstract nature, as it correlates with corporate behavior.Consequently, promoting corporate integrity becomes imperative.Secondly, regulators and policymakers can utilize our findings to consider the impact of corporate culture while formulating tax-related regulations.This understanding can lead to more effective and targeted policy measures.Thirdly, investors benefit from our research by recognizing the relevance of corporate culture in influencing corporate behavior.This awareness enables them to make more informed and accurate assessments of companies.Lastly, tax authorities can leverage our findings to make well-informed decisions on reducing tax avoidance.In summary, our study provides actionable insights for stakeholders to foster corporate integrity, create effective tax policies, make informed investment decisions, and address tax avoidance challenges.
Finally, a couple of limitations of our research can be noted.First, corporate integrity has a broad meaning.We use a text-based measure of corporate integrity culture as a proxy for corporate integrity.While our measure is useful in capturing the extent of corporate integrity, some aspects of corporate integrity may be left out.One way to address this limitation is for future research to utilize other proxies for corporate integrity that may reflect other aspects of corporate integrity.Second, while we employ several proxies for tax avoidance for robustness, there are a few other proxies that are not included.Future researchers may extend our study by including additional measures of tax avoidance.For instance, recently, a new measure of tax avoidance called "uncertain tax positions" has been examined in the literature.This new measure is enabled by the Accounting Standard Codification (ASC) 740.It would be interesting for future research to explore the impact of corporate integrity using this new proxy for tax avoidance.

Table 3 .
Robustness checks.://doi.org/10.1371/journal.pone.0298528.t003tax avoidance against this predicted score and control variables.Model 4 shows the secondstage regression, where the coefficient of corporate integrity is significantly negative.The firststage regression has a Shea partial R 2 of 30.89% and a highly statistically significant F-statistic of 14,907.08,confirming the robustness of our instrument.Overall, there is robust evidence that companies with stronger corporate integrity are significantly less aggressive in avoiding taxes, corroborating our hypothesis. https